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Program Description 1 

Milieu teaching is a practice that involves manipulating or arranging 
stimuli in a preschool child’s natural environment to create a setting 
that encourages them to engage in a targeted behavior. For example, 
a teacher might place a desirable toy in a setting to encourage a 
student to request that toy (where requesting a toy is the desired 
target behavior). Typically, milieu teaching involves four strategies 
that a teacher will utilize to encourage a student to demonstrate a 
target behavior: modeling, mand-modeling, incidental teaching, and 
time-delay. Through adult modeling and functional consequences 
associated with child requests, targeted language behaviors can be 
improved in children who may have language delays or disabilities. 

Research 2 

One study of milieu teaching that falls within the scope of the Early 
Childhood Education Interventions for Children with Disabilities 
review protocol meets What Works Clearinghouse (WWC) evidence 
standards, and no studies meet WWC evidence standards with res- 
ervations. The one study included 40 preschool children with devel- 
opmental delays (eligible for this topic area) attending two schools in 
Davidson County, Tennessee. 

Based on this one study, the WWC considers the extent of evidence for milieu teaching on preschool children with 
disabilities to be small for communication/language competencies. Six other domains are not reported in this inter- 
vention report. (See the Effectiveness Summary for further description of all domains.) 

Effectiveness 

Milieu teaching was found to have no discernible effects on communication/language competencies for preschool 
children with disabilities. 



Table 1. Summary of findings 3 
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Program Information 

Background 

Milieu teaching was first described by Hart and Rogers-Warren (1978) as a collection of “naturalistic” instructional 
procedures that build on incidental teaching methods described by Hart and Risley (1975). 4 The milieu teaching 
practice does not have a single developer, though several groups of investigators have produced information and 
materials for this practice (see below). The one study that meets standards evaluated a milieu teaching program 
described by Kaiser, Hendrickson, and Alpert (1991) that was selected because it had a well-specified, written 
manual and was deemed a state-of-the-art language program (see p. 156 of Yoder, Kaiser, & Alpert, 1991). Read- 
ers interested in using milieu teaching practices in their classrooms can refer to sources available through Internet 
searches. A list of examples follows, although these sources have not been reviewed or endorsed by the WWC: 

• Augmentative and Alternative Communication (AAC) Connecting Young Kids (YAACK): http://aac.unl.edu/ 
yaack/d3.html 

• Research Autism: http://www.researchautism.net/autism_treatments_therapies_intervention. ikml?ra=91 

• Hart, B. M., & Risley, T. R. (1 975). Incidental teaching of language in the preschool. Journal of Applied Behavior 
Analysis, 8, 411-420. 

• Hart, B., & Rogers-Warren, A. (1978). A milieu approach to teaching language. In R. L. Schiefelbusch (Ed.), 
Language intervention strategies (pp. 193-235). Baltimore, MD: University Park Press. 

Program details 

Milieu teaching instruction often occurs in a routine environment known to the child. Instructional episodes often 
begin by following a child’s attentional lead. By manipulating or arranging stimuli in the child’s natural environment, 
or milieu, adults create a setting that encourages children to initiate interactions and offers opportunities for adults 
to use prompting and modeling of communication behaviors in a natural manner. In this environment, children initi- 
ate communication by pointing to or requesting desirable items or activities and receive positive consequences for 
communication targets when adults respond to their request, which is intended to encourage future communication. 

Four specific milieu teaching procedures are typically used to teach functional language to children with disabilities 
or developmental delays: modeling, mand-modeling, incidental teaching, and time-delay. Modeling refers to talking 
to the child and narrating activities, such as “Pick up the cup.” In mand-modeling, the teacher will encourage the 
child to request the target item verbally, by making requests (“mands”) such as “What do you want?” In incidental 
teaching, the teacher will follow a child’s attential lead and look expectantly at the child in an effort to encourage 
the child to request the target item. Finally, time-delay is a procedure whereby the adult will wait a period of time for 
a child’s response before attempting another prompt or teaching strategy. 

Cost 

Some published milieu teaching procedures are available free to the public. Information is not available about the 
costs of training for and implementation of milieu teaching. 
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Research Summary 

One hundred sixty-one studies reviewed by the WWC investigated 
the effects of milieu teaching on preschool children with disabilities. 

One study (Yoder, Kaiser, & Alpert, 1991), is a randomized controlled 
trial that meets WWC evidence standards. That one study is summa- 
rized in this report. No studies meet WWC evidence standards with 
reservations. The remaining 160 studies do not meet either WWC 
ligibility screens or evidence standards. (See references beginning 
on p. 5 for citations for all 161 studies.) 

Fourteen additional studies were reviewed against the pilot Single- 
Case Design standards. Two studies meet the pilot Single-Case 
Design standards, two studies meet the pilot Single-Case Design standards with reservations, and 10 did not meet 
pilot Single-Case Design standards. Studies reviewed against pilot Single-Case Design standards are listed in 
Appendix D and do not contribute to the intervention’s rating of effectiveness. 

Summary of study meeting WWC evidence standards without reservations 

Yoder et al. (1991) examined the effects of milieu teaching relative to a communication training program comparison 
group in a randomized controlled trial of 40 preschool students with developmental delays. This study took place in 
Davidson County, Tennessee. Approximately 60 ten-minute intervention sessions were conducted over the course 
of the study. This experiment utilized measurements on nine eligible outcomes, all in the communication/language 
competencies domain. 5 

Summary of studies meeting WWC evidence standards with reservations 

No studies of milieu teaching meet WWC evidence standards with reservations. 



Table 2. Scope of reviewed research 



Grade 


Prekindergarten 


Delivery method 


Small group 


Program type 


Practice 


Studies reviewed 


161 


Meets WWC standards 


1 study 


Meets WWC standards 
with reservations 


0 studies 
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Effectiveness Summary 

The WWC review of interventions for Early Childhood Education Interventions for Children with Disabilities 
addresses child outcomes in seven domains: cognitive development, communication/language competencies, 
literacy, math competencies, social-emotional development/behavior, functional abilities, and physical well-being. 
The one study that contributes to the effectiveness rating in this report covers one domain: communication/language 
competencies. The findings below present the authors’ estimates and WWC-calculated estimates of the size and 
the statistical significance of the effects of milieu teaching on preschool children with disabilities. For a more detailed 
description of the rating of effectiveness and extent of evidence criteria, see the WWC Rating Criteria on p. 26. 

Summary of effectiveness for the communication/language competencies domain 

One study reported findings in the communication/language competencies domain. 

Yoder et al. (1991) did not report any statistically significant findings on any of the nine eligible outcomes assessed 
in this study, and the WWC confirmed those calculations. According to WWC criteria, the one study that meets 
evidence standards demonstrated an indeterminate effect of milieu teaching for the communication/language 
competencies domain. 

Thus, for the communication/language competencies domain, one study showed an indeterminate effect. This results 
in a rating of no discernible effects, with a small extent of evidence. 



Table 3. Rating of effectiveness and extent of evidence for the communication/language competencies domain 



Rating of effectiveness 


Criteria met 


No discernible effects 


The review of milieu teaching in the communication/language competencies domain had one study showing an 


No affirmative evidence of effects. 


indeterminate effect. 


Extent of evidence 


Criteria met 


Small 


The review of milieu teaching in the communication/language competencies domain was based on one study that 
included two schools and 40 students. 
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348-349. The study is ineligible for review because it does not use a sample aligned with the protocol— the sam- 
ple either includes less than 50% students with identified disabilities or more than 50% students with autism. 

Trent, J. A., Kaiser, A. P, & Wolery, M. (2005). The use of responsive interaction strategies by siblings. Topics in 

Early Childhood Special Education, 25(2), 107-118. The study is ineligible for review because it does not use a 
sample aligned with the protocol— the sample is not within the specified age or grade range. 

Van der Meer, L. A. J., & Rispoli, M. (2010). Communication interventions involving speech-generating devices for 
children with autism: A review of the literature. Developmental Neurorehabilitation, 13(4), 294-306. The study 
is ineligible for review because it is a secondary analysis of the effectiveness of an intervention, such as a 
meta-analysis or research literature review. 

Van Kleeck, A., Schwarz, A. L, Fey, M., Kaiser, A., Miller, J., & Weitzman, E. (2010). Should we use telegraphic or 

grammatical input in the early stages of language development with children who have language impairments? A 
meta-analysis of the research and expert opinion. American Journal of Speech-Language Pathology, 79(1), 3-21 . 
The study is ineligible for review because it is a secondary analysis of the effectiveness of an intervention, such 
as a meta-analysis or research literature review. 

Vismara, L. A., & Rogers, S. J. (2010). Behavioral treatments in autism spectrum disorder: What do we know? 

Annual Review of Clinical Psychology, 6(1), 447-468. The study is ineligible for review because it is a second- 
ary analysis of the effectiveness of an intervention, such as a meta-analysis or research literature review. 

Vohr, B., St. Pierre, L., Topol, D., Jodoin-Krauzyk, J., Bloome, J., & Tucker, R. (2010). Association of maternal com- 
municative behavior with child vocabulary at 18-24 months for children with congenital hearing loss. Early 
Human Development, 86(4), 255-260. The study is ineligible for review because it does not use a sample 
aligned with the protocol— the sample is not within the specified age or grade range. 
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Wang, P. (2008). Effects of a parent training program on the interactive skills of parents of children with autism in 
China. Journal of Policy and Practice in Intellectual Disabilities, 5(2), 96-104. The study is ineligible for review 
because it does not use a sample aligned with the protocol— the sample either includes less than 50% stu- 
dents with identified disabilities or more than 50% students with autism. 

Warger, C. (1999). Early childhood instruction in the natural environment (ERIC EC Digest No. E591). Reston, VA: 
ERIC Clearinghouse on Disabilities and Gifted Education. (ERIC No. EDO-99-18). The study is ineligible for 
review because it is a secondary analysis of the effectiveness of an intervention, such as a meta-analysis or 
research literature review. 

Warren, S. F., & Brady, N. C. (2007). The role of maternal responsivity in the development of children with intellectual 
disabilities. Mental Retardation and Developmental Disabilities Research Reviews, 73(4), 330-338. The study is 
ineligible for review because it is a secondary analysis of the effectiveness of an intervention, such as a meta- 
analysis or research literature review. 

Warren, S. F., Fey, M. E., Finestack, L. H., Brady, N. C., Bredin-Oja, S. L., & Fleming, K. K. (2008). A randomized trial 
of longitudinal effects of low-intensity responsivity education/prelinguistic milieu teaching. Journal of Speech, 
Language, and Hearing Research, 51(2), 451-470. The study is ineligible for review because it does not use a 
sample aligned with the protocol— the sample is not within the specified age or grade range. 

Warren, S. F., & Reichle, J. (1992). Causes and effects in communication and language intervention. Baltimore, MD: 
Paul H. Brookes. The study is ineligible for review because it is a secondary analysis of the effectiveness of an 
intervention, such as a meta-analysis or research literature review. 

Warren, S. F., & Yoder, P. J. (1996). Enhancing communication and language development in young children with 
developmental delays and disorders. Peabody Journal of Education, 77(4), 118-132. The study is ineligible for 
review because it is a secondary analysis of the effectiveness of an intervention, such as a meta-analysis or 
research literature review. 

Warren, S. F., Yoder, P. J., Gazdag, G. E., & Kim, K. (1993). Facilitating prelinguistic communication skills in young 
children with developmental delay. Journal of Speech and Hearing Research, 36(1), 83-97. The study is 
ineligible for review because it does not use a sample aligned with the protocol— the sample is not within the 
specified age or grade range. 

Watson, L. R., Lanter, E., McComish, C., & Poston Roy, V. (2004). Enhancing the communication development of 
toddlers with autism spectrum disorders. Journal of Cognitive and Behavioral Psychotherapies, 4(2), 179-201. 
The study is ineligible for review because it does not use a sample aligned with the protocol— the sample 
either includes less than 50% students with identified disabilities or more than 50% students with autism. 

Watson, T. S., Meeks, C., Dufrene, B., & Lindsay, C. (2002). Sibling thumb sucking: Effects of treatment for targeted 
and untargeted siblings. Behavior Modification, 26(3), 412-423. The study is ineligible for review because it 
does not use a sample aligned with the protocol— the sample either includes less than 50% students with 
identified disabilities or more than 50% students with autism. 

Wetherby, A. M., & Woods, J. (2008). Developmental approaches to treatment. In F. Volkmar, A. Klin, & K. Chawarska 
(Eds.), Autism spectrum disorders in infancy and early childhood (pp. 1 70-206). New York: Guilford. The study 
is ineligible for review because it does not use a sample aligned with the protocol— the sample either includes 
less than 50% students with identified disabilities or more than 50% students with autism. 

Wetherby, A. M., & Woods, J. J. (2006). Early social interaction project for children with autism spectrum disorders 
beginning in the second year of life: A preliminary study. Topics in Early Childhood Special Education, 26(2), 
67-82. The study is ineligible for review because it does not use a sample aligned with the protocol— the sam- 
ple either includes less than 50% students with identified disabilities or more than 50% students with autism. 

Williams, A. L., McLeod, S., & McCauley, R. J. (2010). Interventions for speech sound disorders in children. Baltimore, 
MD: Paul H. Brookes. The study is ineligible for review because it is a secondary analysis of the effectiveness 
of an intervention, such as a meta-analysis or research literature review. 
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Wolery, M., Barton, E. E., & Hine, J. F. (2005). Evolution of applied behavior analysis in the treatment of individuals 
with autism. Exceptionality, 73(1), 11-23. The study is ineligible for review because it is a secondary analysis 
of the effectiveness of an intervention, such as a meta-analysis or research literature review. 

Woods, J., Kashinath, S., & Goldstein, H. (2004). Effects of embedding caregiver-implemented teaching strategies 
in daily routines on children’s communication outcomes. Journal of Early Intervention, 26(3), 175-193. The 
study is ineligible for review because it does not use a sample aligned with the protocol— the sample is not 
within the specified age or grade range. 

Yoder, P., Kaiser, A., Goldstein, H., Alpert, C., Mousetis, L, Kaczmarek, L, & Fischer, R. (1995). An exploratory com- 
parison of milieu teaching and responsive interaction in classroom applications. Journal of Early Intervention, 
79(3), 218-242. The study is ineligible for review because it does not use a sample aligned with the protocol— the 
sample either includes less than 50% students with identified disabilities or more than 50% students with autism. 

Yoder, P., & Warren, S. (1999). Facilitating self-initiated proto-declaratives and proto-imperatives in prelinguistic 
children with developmental disabilities. Journal of Early Intervention, 22(4), 337-354. The study is ineligible 
for review because it does not use a sample aligned with the protocol— the sample is not within the specified 
age or grade range. 

Yoder, P. J., & Lieberman, R. G. (2010). Brief report: Randomized test of the efficacy of picture exchange communi- 
cation system on highly generalized picture exchanges in children with ASD. Journal of Autism and Develop- 
mental Disorders, 40(5), 629-632. The study is ineligible for review because it does not use a sample aligned 
with the protocol— the sample either includes less than 50% students with identified disabilities or more than 
50% students with autism. 

Yoder, P. J., & McDuffie, A. S. (2006). Treatment of responding to and initiating joint attention. In T. Charman & W. 
Stone (Eds.), Social and communication development in autism spectrum disorders: Early identification, diag- 
nosis, and intervention (pp. 117-142). New York: Guilford Press. The study is ineligible for review because it is a 
secondary analysis of the effectiveness of an intervention, such as a meta-analysis or research literature review. 

Yoder, P. J., & Stone, W. L. (2006). A randomized comparison of the effect of two prelinguistic communication 
interventions on the acquisition of spoken communication in preschoolers with ASD. Journal of Speech, 
Language, and Hearing Research, 49(4), 698-71 1 . The study is ineligible for review because it does not use a 
sample aligned with the protocol— the sample either includes less than 50% students with identified disabili- 
ties or more than 50% students with autism. 

Additional source: 

Yoder, P., & Stone, W. L. (2006). Randomized comparison of two communication interventions for preschool- 
ers with autism spectrum disorders. Journal of Consulting & Clinical Psychology, 74(3), 426-435. 

Yoder, P. J., & Warren, S. F. (1998). Maternal responsivity predicts the prelinguistic communication intervention that 
facilitates generalized intentional communication. Journal of Speech, Language, and Hearing Research, 41(5), 
1207-1219. The study is ineligible for review because it does not use a sample aligned with the protocol— the 
sample is not within the specified age or grade range. 

Yoder, P. J., & Warren, S. F. (2001). Relative treatment effects of two prelinguistic communication interventions on 
language development in toddlers with developmental delays vary by maternal characteristics. Journal of 
Speech, Language, and Hearing Research, 44(1), 224-237. The study is ineligible for review because it does 
not use a sample aligned with the protocol— the sample is not within the specified age or grade range. 

Yoder, P. J., & Warren, S. F. (2002). Effects of prelinguistic milieu teaching and parent responsivity education on 
dyads involving children with intellectual disabilities. Journal of Speech, Language, and Hearing Research, 
45(6), 1 1 58-1 1 74. The study is ineligible for review because it does not use a sample aligned with the proto- 
col— the sample is not within the specified age or grade range. 

Yoder, P. J., & Warren, S. F. (2004). Early predictors of language in children with and without Down syndrome. 
American Journal on Mental Retardation, 109(4), 285-300. The study is ineligible for review because it does 
not examine the effectiveness of an intervention. 
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Yoder, P. J., Warren, S. F., Kim, K., & Gazdag, G. E. (1994). Facilitating prelinguistic communication skills in young 
children with developmental delay II: Systematic replication and extension. Journal of Speech and Hearing 
Research, 37(4), 841-851 . The study is ineligible for review because it does not use a sample aligned with the 
protocol— the sample is not within the specified age or grade range. 
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Appendix A: Research details for Yoder, Kaiser, & Alpert (1991) 

Yoder, P. J., Kaiser, A. P., & Alpert, C. L. (1991). An exploratory study of the interaction between language 
teaching methods and child characteristics. Journal of Speech and Hearing Research, 34(1), 155-167. 

Table A. Summary of findings Meets WWC evidence standards without reservations 



Study findings 
Average improvement index 

Outcome domain Sample size (percentile points) Statistically significant 



Communication/language 40 students +3 No 

competencies 



Setting 


The study took place in Davidson County, Tennessee. The language sessions were adminis- 
tered in the children’s classrooms or a nearby therapy room. 


Study sample 


Forty students with developmental delays participated in this study. The students were from 
two schools, a university-based preschool and a public school. The students from the univer- 
sity-based preschool had developmental delays of at least 20% of their chronological age in at 
least one developmental area on the Denver Developmental Screening Test. The students from 
the public school had scores more than one and one half standard deviations below the mean 
on four out of seven developmental areas. The 40 students were randomly assigned to two 
conditions: 20 received instruction with the milieu teaching method, and 20 received instruc- 
tion with the communication training program. 


Intervention 

group 


Milieu teaching is a naturalistic instruction method whereby the trainer follows the lead of the 
child in determining when to teach and what language form to elicit. In milieu teaching, the 
environment is arranged to include objects and activities that interest the child. Instructional 
strategies including incidental teaching and time-delay are utilized to encourage child com- 
munication. The goal of the milieu teaching method is that children learn to comprehend 
language structures from natural and informal adult modeling and active communication about 
the object or activity that is of interest to the child. When a child produces targeted language 
behavior during the activities, those utterances are consequated according to the child’s inter- 
est. For example, if a child requests a toy, giving the toy to the child serves as a functional 
consequence of the behavior. For students in both conditions, developmentally appropriate 
language targets were selected from the communication training program as the goals for the 
intervention. Children were separated into groups of two or three, and the sessions lasted 10 
minutes per child. The treatment lasted for 60 sessions. The first half of each session con- 
sisted of a group activity (games, making collages, etc.); in the second half of the session, 
children could choose toys of interest to them from the variety of toys available. 
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Comparison 

group 



Outcomes and 
measurement 



Support for 
implementation 



The comparison condition consisted of another treatment, called communication training. 
Communication training involves a more structured drill-and-practice approach than the natu- 
ralistic approach espoused by milieu teaching and uses a predetermined set of rules to select 
the trials and language targets. Comprehension of language targets was taught explicitly in 
this condition rather than implicitly in milieu teaching. The communication training group used 
consequences that included verbal feedback and tangible rewards to increase the likelihood of 
child response. For children with more significant disabilities, the rewards sometimes were not 
tied directly to a child’s utterance. The sessions were conducted for similar amounts of time 
and with the same numbers of students in both the intervention and comparison conditions. 

Nine eligible outcomes were assessed in this study, and all fall within the communication/ 
language competencies domain. For a more detailed description of these outcome measures, 
see Appendix B. 

All language teachers received 12 hours of group training. They also received individual and 
small-group training (ranging from two to ten hours depending on their entry skills, abilities in 
executing their assigned method, and the children’s specific needs). During the intervention 
period, the language teachers were observed on a weekly basis, and weekly group meetings 
were held to support teachers and solve training and behavioral management problems. 
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Appendix B: Outcome measures for each domain 



Communication/language competancies 


Intelligibility (percentage of utterances 
intelligible) 


Intelligibility was measured as the proportion of utterances that were intelligible (partially or completely) 
relative to the total number of utterances during a 60-minute speech sample (as cited in Yoder et al., 1991). 


Mean length of utterances 


Student syntactic level was measured as the mean length of utterances (in morphemes) as derived from 
the 60-minute speech sample (as cited in Yoder et al., 1991). 


Percentage of self -initiated utterances 


The extent to which the child used language in a spontaneous fashion was measured as the number of 
self-initiated intelligible utterances divided by the total number of utterances during the 60-minute speech 
sample (as cited in Yoder et al., 1991). 


Rate of different words (per minute) 


Vocabulary was measured as the number of different words uttered during the 60-minute speech sample 
divided by 60 to obtain a per-minute rate (as cited in Yoder et al., 1991). 


Rate of intelligibility utterances (per minute) 


The rate of talking was measured as the number of intelligible utterances per minute (as cited in Yoder et 
al„ 1991). 


Rate of utterances (per minute) 


The rate of talking also was measured as the total number of utterances (intelligible or not) during the 
60-minute speech sample divided by 60 to obtain the rate per minute (as cited in Yoder et al., 1991). 


SICD-E age 


The Sequenced Inventory of Communication Development-Expressive (SICD-E) score indicates general 
expressive language level in age-equivalent units (months) (as cited in Yoder et al., 1991). 


SICD-R age 


The Sequenced Inventory of Communication Development-Receptive (SICD-R) score indicates general 
receptive language level in age-equivalent units (months) (as cited in Yoder et al., 1991). 


Type token ratio 


Vocabulary also was measured using the type token, the number of unique words in the first 50 intelligible 
and complete utterances from the 60-minute speech sample (as cited in Yoder et al., 1991). 
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Appendix C: Findings included in the rating for the communication/language competencies domain 









Mean 

(standard deviation) 


WWC calculations 




Outcome measure 


Study 

sample 


Sample 

size 


Intervention 

group 


Comparison 

group 


Mean 

difference 


Effect 

size 


Improvement 

index 


p-value 


Yoder et al., 1991 a 


Milieu teaching vs. communication training program 


Intelligibility (percentage of 
utterances intelligible) 


Preschool 

students 


40 students 


0.9 (0.1) 


0.9 (0.1) 


0.0 


0.00 


0 


>0.05 


Mean length of utterances 


Preschool 

students 


40 students 


2.6 (1.1) 


2.5(11) 


0.1 


0.09 


+4 


>0.05 


Percentage of self -initiated 
utterances 


Preschool 

students 


40 students 


0.6 (0.1) 


0.4 (0.1) 


0.2 


0.48 


+18 


>0.05 


Rate of different words (per 
minute) 


Preschool 

students 


40 students 


3.3(1.2) 


3.2(17) 


0.1 


0.07 


+3 


>0.05 


Rate of intelligibility utterances 
(per minute) 


Preschool 

students 


40 students 


8.1 (3.0) 


8.1 (3.0) 


0.0 


0.00 


0 


>0.05 


Rate of utterances (per minute) 


Preschool 

students 


40 students 


9.1 (3.0) 


9.2 (3.1) 


-0.1 


-0.03 


-1 


>0.05 


SICD-E age 


Preschool 

students 


40 students 


31.2(7.8) 


30.8 (7.1) 


0.4 


0.05 


+2 


>0.05 


SICD-R age 


Preschool 

students 


40 students 


32.0 (5.2) 


31.6 (5.0) 


0.4 


0.08 


+3 


>0.05 


Type token ratio 


Preschool 

students 


40 students 


0.4 (0.1) 


0.4 (0.1) 


0.0 


0.00 


0 


>0.05 



Domain average for communication/language competencies (Yoder etal., 1991) 0.08 +3 Not 

statistically 

significant 



Domain average for communication/language competencies across ail studies 0.08 +3 na 



Table Notes: Positive results for mean difference, effect size, and improvement index favor the intervention group; negative results favor the comparison group. The effect size is 
a standardized measure of the effect of an intervention on student outcomes, representing the change (measured in standard deviations) in an average student's outcome that can 
be expected if the student is given the intervention. The improvement index is an alternate presentation of the effect size, reflecting the change in an average student's percentile 
rank that can be expected if the student is given the intervention. The WWC-computed average effect size is a simple average rounded to two decimal places; the average im- 
provement index is calculated from the average effect size. The statistical significance of the study’s domain average was determined by the WWC. SICD-E = Sequenced Inventory 
of Communication Development-Expressive. SICD-R = Sequenced Inventory of Communication Development-Receptive, na = not applicable. 

a For Yoder et al. (1991), a correction for multiple comparisons was needed, but this did not affect significance levels. The p-values presented here were reported in the original study. 
The WWC calculated the program group mean using a difference-in-differences approach (see the WWC Procedures and Standards Handbook, Appendix B) by adding the impact of 
the program (i.e., difference in mean gains between the intervention and comparison groups) to the unadjusted comparison group posttest means. For the intelligibility (percentage of 
utterances intelligible) and percentage of self-initiated utterances outcomes, the WWC calculated the effect size using the Cox effect size index, as these outcomes are dichotomous. 
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Appendix D: Single-case design studies reviewed for this intervention 



Study citation 


Study disposition 


Kaiser, A. P., & Hester, P. P. (1994). Generalized effects of enhanced milieu teaching. Journal of 
Speech and Hearing Research, 37(6), 1320-1340. 


Meets WWC pilot Single-Case Design standards. 


Warren, S., & Gazdag, G. (1990). Facilitating early language development with milieu intervention 
procedures. Journal of Early Intervention, 14(1), 62-86. 


Meets WWC pilot Single-Case Design standards. 


Hawkins, S. R., & Schuster, J. W. (2007). Using a mand-model procedure to teach preschool 
children initial speech sounds. Journal of Developmental and Physical Disabilities, 19( 1), 65-80. 


Meets WWC pilot Single-Case Design standards with 
reservations. 


Sheldon, K. M. (1997). Effects of a milieu teaching strategy in a storybook context on the acquisi- 
tion, maintenance, and generalization of expressive language by young children with development 
disabilities. Dissertation Abstracts International, 53(07A), 233-2602. 


Meets WWC pilot Single-Case Design standards with 
reservations. 


Kaczmarek, L. A., Hepting, N. H., & Dzubak, M. (1996). Examining the generalization of milieu 
language objectives in situations requiring listener preparatory behaviors. Topics in Early Childhood 
Special Education, 16(2), 139-167. 


Does not meet WWC pilot Single-Case Design stan- 
dards because it is an ABAB or multiple baseline design 
with less than three data points in a phase. 


Kaiser, A., & Whiteman, B. (1996). The generalized effects of training trainers to teach parents to 
implement milieu teaching. Journal of Early Intervention, 20(1), 30-51. 


Does not meet WWC pilot Single-Case Design stan- 
dards because it does not have at least three attempts 
to demonstrate an intervention effect at three different 
points in time. 


Kaiser, A. P. (1990), Toward a hybrid model of parent-implemented language intervention: Analysis 
of the effects of milieu and responsive-interaction teaching by parents. Unpublished paper pre- 
sented at the Annual Meeting of the American Association on Mental Retardation, Atlanta, GA. 


Does not meet WWC pilot Single-Case Design 
standards because inter-assessor agreement was not 
measured at least once in each phase and on at least 
20% of the data points in each condition. 


Kaiser, A. P, Hemmeter, M., Ostrosky, M., Alpert, C.,& Hancock, T. (1995). The effects of group 
training and individual feedback on parent use of milieu teaching. Journal of Childhood Communi- 
cation Disorders, 16(2), 39-48. 


Does not meet WWC pilot Single-Case Design stan- 
dards because it is an ABAB or multiple baseline design 
with less than three data points in a phase. 


Kaiser, A. P, Hester, P. P, Alpert, C. L., & Whiteman, B. C. (1995). Preparing parent trainers: 

An experimental analysis of effects on trainers, parents, and children. Topics in Early Childhood 
Special Education, 15(A), 385-414. 


Does not meet WWC pilot Single-Case Design stan- 
dards because it does not have at least three attempts 
to demonstrate an intervention effect at three different 
points in time. 


Kaiser, A. P, & National Institute of Child Health and Human Development (U.S.), (1994). Enhanced 
milieu teaching: An analysis of applications by interventionists and classroom teachers. Bethesda, 
MD: National Institute of Child Health and Human Development. 


Does not meet WWC pilot Single-Case Design stan- 
dards because it is an ABAB or multiple baseline design 
with less than three data points in a phase. 


Peterson, P (2000). The effects of teaching milieu language teaching skills to parents of children 
prenatally exposed to drugs. Dissertation Abstracts International, 61 ( 12B), 85-6739. 


Does not meet WWC pilot Single-Case Design stan- 
dards because it does not have at least three attempts 
to demonstrate an intervention effect at three different 
points in time. 


Peterson, R, Carta, J. J., & Greenwood, C. (2005). Teaching enhanced milieu language teaching 
skills to parents in multiple risk families. Journal of Early Intervention, 27(2), 94-109. 


Does not meet WWC pilot Single-Case Design stan- 
dards because it does not have at least three attempts 
to demonstrate an intervention effect at three different 
points in time. 


Warren, S. F. (1992). Facilitating basic vocabulary acquisition with milieu teaching procedures. 
Journal of Early Intervention, 16(3), 235-251. 


Does not meet WWC pilot Single-Case Design stan- 
dards because it does not have at least three attempts 
to demonstrate an intervention effect at three different 
points in time. 


Warren, S. F., & Bambara, L. M. (1989). An experimental analysis of milieu language intervention: 
Teaching the action-object form. Journal of Speech and Hearing Disorders, 54(3), 448-461. 


Does not meet WWC pilot Single-Case Design stan- 
dards because it does not have at least three attempts 
to demonstrate an intervention effect at three different 
points in time. 



Table Notes: The supplemental studies presented in this table do not factor into the determination of the intervention rating. 



Milieu Teaching April 201 2 



Page 24 





WWC Intervention Report 



Endnotes 

1 Milieu teaching does not have a single developer or official description. The descriptive information for this program was obtained 
from publicly available sources: descriptions of this practice (see the websites listed under Program Information) and research articles 
(i.e., Yoder, Kaiser, &Alpert, 1991; Yoder, Kaiser, Goldstein, &Alpert, 1995; Yoder, Camarata, & Gardner, 2005). Further verification of 
the accuracy of the descriptive information for this program is beyond the scope of this review. The literature search reflects docu- 
ments publicly available by May 201 1 . 

2 The studies in this report were reviewed using WWC Evidence Standards, Version 2.1, as described in the Early Childhood Educa- 
tion Interventions for Children with Disabilities review protocol Version 2.0. The evidence presented in this report is based on available 
research. Findings and conclusions may change as new research becomes available. 

3 For criteria used in the determination of the rating of effectiveness and extent of evidence, see the WWC Rating Criteria on p. 26. 
These improvement index numbers show the average and range of student-level improvement indices for all findings across the 
studies. The WWC review of interventions for the Early Childhood Education Interventions for Children with Disabilities topic area 
addresses student outcomes in seven domains: cognitive development, communication/language competencies, literacy, math 
competencies, social-emotional development/behavior, functional abilities, and physical well-being. Table 1 includes results only for 
communication/language competencies, as this was the only domain for which outcomes were assessed in the one study that meets 
evidence standards. 

4 Milieu teaching strategies have been elaborated and evolved, where expanded models have been labeled enhanced milieu teaching, 
and modifications for prelinguistic children are labeled as prelinguistic milieu teaching. Given that the focal article of this review utilizes 
standard milieu teaching, we do not describe these variants here. 

5 Five outcomes assessed in this article were determined to be ineligible and, therefore, are not included in this report. Three assess- 
ments were selected from the communication training program (i.e., the program provided to the comparison group) and were judged 
to be too overaligned to be eligible. Two other outcomes that were deemed ineligible were the raw scores from the Sequenced Inven- 
tory of Communication Development Expressive and Receptive language levels, which were divided by the children’s cognitive age. 
Because the raw scores (without the age standardization) were included in this report, the additional age-standardized outcome was 
considered a duplicated result and, therefore, ineligible. 

Recommended Citation 

U.S. Department of Education, Institute of Education Sciences, What Works Clearinghouse. (2012, April). Early 
Childhood Education Interventions for Children with Disabilities intervention report: Milieu teaching. Retrieved 
from http://whatworks.ed.gov. 
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WWC Rating Criteria 



Criteria used to determine the rating of a study 



Study rating 


Criteria 


Meets WWC evidence standards 
without reservations 


A study that provides strong evidence for an intervention’s effectiveness, such as a well-implemented RCT. 


Meets WWC evidence standards 
with reservations 


A study that provides weaker evidence for an intervention's effectiveness, such as a QED or an RCT with high 
attrition that has established equivalence of the analytic samples. 


Criteria used to determine the rating of effectiveness for an intervention 


Rating of effectiveness 


Criteria 


Positive effects 


Two or more studies show statistically significant positive effects, at least one of which met WWC evidence 
standards for a strong design, AND 

No studies show statistically significant or substantively important negative effects. 


Potentially positive effects 


At least one study shows a statistically significant or substantively important positive effect, AND 

No studies show a statistically significant or substantively important negative effect AND fewer or the same number 

of studies show indeterminate effects than show statistically significant or substantively important positive effects. 


Mixed effects 


At least one study shows a statistically significant or substantively important positive effect AND at least one study 
shows a statistically significant or substantively important negative effect, but no more such studies than the number 
showing a statistically significant or substantively important positive effect, OR 

At least one study shows a statistically significant or substantively important effect AND more studies show an 
indeterminate effect than show a statistically significant or substantively important effect. 


Potentially negative effects 


One study shows a statistically significant or substantively important negative effect and no studies show 
a statistically significant or substantively important positive effect, OR 

Two or more studies show statistically significant or substantively important negative effects, at least one study 
shows a statistically significant or substantively important positive effect, and more studies show statistically 
significant or substantively important negative effects than show statistically significant or substantively important 
positive effects. 


Negative effects 


Two or more studies show statistically significant negative effects, at least one of which met WWC evidence 
standards for a strong design, AND 

No studies show statistically significant or substantively important positive effects. 


No discernible effects 


None of the studies shows a statistically significant or substantively important effect, either positive or negative. 


Criteria used to determine the extent of evidence for an intervention 


Extent of evidence 


Criteria 


Medium to large 


The domain includes more than one study, AND 
The domain includes more than one school, AND 

The domain findings are based on a total sample size of at least 350 students, OR, assuming 25 students in a class, 
a total of at least 14 classrooms across studies. 


Small 


The domain includes only one study, OR 
The domain includes only one school, OR 

The domain findings are based on a total sample size of fewer than 350 students, AND, assuming 25 students 
in a class, a total of fewer than 14 classrooms across studies. 
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Glossary of Terms 

Attrition 

Clustering adjustment 
Confounding factor 

Design 
Domain 
Effect size 

Eligibility 

Equivalence 

Extent of evidence 

Improvement index 

Multiple comparison 
adjustment 

Quasi-experimental 
design (QED) 

Randomized controlled 
trial (RCT) 

Rating of effectiveness 

Single-case design 
Standard deviation 



Statistical significance 



Substantively important 

Please 



Attrition occurs when an outcome variable is not available for all participants initially assigned 
to the intervention and comparison groups. The WWC considers the total attrition rate and 
the difference in attrition rates across groups within a study. 

If treatment assignment is made at a cluster level and the analysis is conducted at the student 
level, the WWC will adjust the statistical significance to account for this mismatch, if necessary. 

A confounding factor is a component of a study that is completely aligned with one of the 
study conditions, making it impossible to separate how much of the observed effect was 
due to the intervention and how much was due to the factor. 

The design of a study is the method by which intervention and comparison groups were assigned. 
A domain is a group of closely related outcomes. 

The effect size is a measure of the magnitude of an effect. The WWC uses a standardized 
measure to facilitate comparisons across studies and outcomes. 

A study is eligible for review and inclusion in this report if it falls within the scope of the 
review protocol and uses either an experimental or matched comparison group design. 

A demonstration that the analysis sample groups are similar on observed characteristics 
defined in the review area protocol. 

An indication of how much evidence supports the findings. The criteria for the extent 
of evidence levels are given in the WWC Rating Criteria on p. 26. 

Along a percentile distribution of students, the improvement index represents the gain 
or loss of the average student due to the intervention. As the average student starts at 
the 50th percentile, the measure ranges from -50 to +50. 

When a study includes multiple outcomes or comparison groups, the WWC will adjust 
the statistical significance to account for the multiple comparisons, if necessary. 

A quasi-experimental design (QED) is a research design in which subjects are assigned 
to treatment and comparison groups through a process that is not random. 

A randomized controlled trial (RCT) is an experiment in which investigators randomly assign 
eligible participants into treatment and comparison groups. 

The WWC rates the effects of an intervention in each domain based on the quality of the 
research design and the magnitude, statistical significance, and consistency in findings. The 
criteria for the ratings of effectiveness are given in the WWC Rating Criteria on p. 26. 

A research approach in which an outcome variable is measured repeatedly within and 
across different conditions that are defined by the presence or absence of an intervention. 

The standard deviation of a measure shows how much variation exists across observations 
in the sample. A low standard deviation indicates that the observations in the sample tend 
to be very close to the mean; a high standard deviation indicates that the observations in 
the sample tend to be spread out over a large range of values. 

Statistical significance is the probability that the difference between groups is a result of 
chance rather than a real difference between the groups. The WWC labels a finding statistically 
significant if the likelihood that the difference is due to chance is less than 5% (p < 0.05). 

A substantively important finding is one that has an effect size of 0.25 or greater, regardless 
of statistical significance. 

see the WWC Procedures and Standards Handbook (version 2.1) for additional details. 
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